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DESCRIPTION 

SURFACTANT TOLERANT CELLULASE AND METHOD FOR MODIFICATION 
THEREOF 

TECHNICAL FIELD 
[0001] 

The present invention relates to a method for 
modifying the N-terminus of a protein having an 
endoglucanase activity (particularly, cellulase belonging to 
family 45 and having an endoglucanase activity ) to 
pyroglutamic acid, to convert the protein into cellulase 
having an endoglucanase activity whose reduction in the 
presence of a surfactant is small, and relates to the 
cellulase . 



BACKGROUND ART 
[0002] 

Cellulose biomass is said to be the most abundant 
resource in natural resources, and thus an efficient 
application of cellulase systems which decompose the 
cellulose biomass is desired in various fields. In this 
development process, various cellulases were purified and 
characterized, and further, various cellulase genes were 
cloned, and classified into families by analyzing the 
sequence homology (see non-patent reference 1) . 

In another aspect, cellulases are utilized, based on 
their properties, in various industrial fields, particularly 
the field of fabric processing. For example, treatment with 
cellulase is carried out to improve the touch and/or 
appearance of cellulose-containing fabric, or for a 
"biowash", which imparts a "stonewash" appearance to colored 
cellulose-containing fabric, thereby providing the fabric 
with localized color variations. Further, in the process 
for manufacturing lyocell, cellulase is used for removing 
the fuzz generated in the process from the fabric surface. 
In this connection, lyocell is a regenerated cellulose 
fabric derived from wood pulp, and has recently attracted 
attention for its properties (such as high strength or water 
absorption) and as a production process that causes less 
environmental pollution. 
[0003] 
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Hitherto, it has been considered that cellulase 
decomposes cellulose by the collaborative effect of plural 
enzymes, i.e., synergy effect. The cellulase group 
consisting of plural enzymes contains enzymes having 
properties inappropriate for the field of fabric processing 
(such as an enzyme which lowers a fiber strength) . 
Therefore, an attempt to separate enzyme components 
appropriate for fabric processing from the cellulase group, 
and to produce the enzyme components, has been carried out 
by utilizing protein separation techniques and/or genetic 
engineering techniques. Particularly, cellulases derived 
from microorganisms belonging to filamentous fungi such as 
genus Trichoderma or genus Humdcola have been subjected to 
serious study. For example, as cellulase components, CBH I, 
EG V (see patent reference 1), NCE2, NCE4, and NCE5 in genus 
Humicola, and CBH I, CBH II, EG II, and EG III in genus 
Trichoderma were isolated, and thus, cellulase preparations 
containing as the major components one or more specific 
cellulase components appropriate for each purpose can be 
produced by preparing overexpressed enzymes or monocomponent 
enzymes using genetic engineering techniques. Further, it 
is clarified that cellulases belonging to family 45, such as 
NCE4 (see patent reference 2), NCE5 (see patent reference 
3), RCE1 (see patent reference 4), or STCE1 (see 
International Application No. PCT/JP2004/15733) , are very 
useful in the above fields. 
[0004] 

In still another aspect, when cellulases are used as a 
detergent for clothing, not only quantitative improvement of 
cellulase components used but also qualitative one is 
desired. More particularly, a detergent for clothing 
contains various surfactants, and a solution obtained by 
solubilizing the detergent for clothing in water is alkaline 
(pHIO to pHll) . Therefore, it is necessary that cellulases 
contained in a detergent for clothing should be resistant to 
various surfactants under alkaline conditions. As a report 
in which a reduction in an activity in the presence of a 
surfactant is suppressed, it was reported by Otzen, D.E. et 
al. that when a mutation was introduced into the internal 
amino acid sequence of Cel45 derived from Humicola insolens, 
the activity thereof at pH7 in the presence of linear alkyl 
benzene sulfonate (LAS) was approximately 3.3 times higher 
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than that of the wild-type (see non-patent reference 2) . 
However, it is found that the suppression of a reduction in 
the activity in the presence of the surfactant, provided by 
the mutation, is limited to Cel45 or homologous proteins 
thereof, and that is not applicable to endoglucanases 
belonging to family 45 having a low homology with Cel45. 
[0005] 

(patent reference 1) International Publication W091/17243 
(patent reference 2) International Publication WO98/03667 
(patent reference 3) International Publication WO01/90375 
(patent reference 4) International Publication WO00/24879 
(non-patent reference 1) Henrissat B., Bairoch A. Updating 
the sequence-based classification of glycosyl hydrolases. 
Biochem. J. 316: 695-696 ( 1996) 

(non-patent reference 2) Daniel E. Otzen, Lars Christiansen, 
Martin Schulein. A comparative study of the unfolding of the 
endoglucanase Cel45 from Humicola insolens in denaturant and 
surfactant. Protein Sci. 8:1878-1887(1999) 

DISCLOSURE OF INVENTION 

PROBLEMS TO BE SOLVED BY THE INVENTION 
[0006] 

An object of the present invention is to provide a 
method for converting a protein having an endoglucanase 
activity (particularly, a protein belonging to family 45 and 
having an endoglucanase activity) to a protein having an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small; a vector used in the method; a protein 
having an endoglucanase activity whose reduction in the 
presence of a surfactant is small; and a polynucleotide 
encoding the same. A further object of the present 
invention is to provide, using the above, a microorganism 
which efficiently produces a useful protein as an enzyme for 
the washing of clothing. 

MEANS FOR SOLVING THE PROBLEMS 
[0007] 

The present inventors conducted intensive studies, and 
as a result, found that proteins in which pyroglutamic acid 
(hereinafter sometimes referred to as pQ) or a peptide 
containing pQ was added to the N-terminus of each protein 
belonging to family 45 and having an endoglucanase, i.e., N- 
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terminus-added cellulases, had an endoglucanase activity 
exhibiting no significant reduction in the activity in the 
presence of a surfactant, in comparison with wild-type 
cellulases . 
[0008] 

The feature that a high endoglucanase activity is 
maintained in the presence of an anionic surfactant is 
particularly useful as enzymes for the washing of clothing, 
but such a cellulase is not known. Further, it has not been 
reported that functions inherent in an enzyme can be 
maintained in the presence of a surfactant by adding thereto 
pyroglutamic acid or a peptide containing the same. 
[0009] 

In another aspect of the present invention, with 
respect to all cellulases in which the N-terminal amino acid 
is not protected and the maintenance of the activity in the 
presence of a surfactant is desired, it is possible, by 
adding pyroglutamic acid or a peptide containing the same to 
the cellulase, to suppress a reduction in the activity in 
the presence of a surfactant. The cellulase which 
pyroglutamic acid or a peptide containing the same are added 
is not particularly limited, but cellulases belonging to 
family 45 are preferable. 
[0010] 

Accordingly, the present invention includes the 
following : 

(1) a method for suppressing a reduction in an endoglucanase 
activity in the presence of a surfactant, characterized by 
modifying a protein having the endoglucanase activity in 
which the N-terminus is an amino acid other than 
pyroglutamic acid, to a protein having the N-terminus of 
pyroglutamic acid; 

(2) the method of (1), wherein the modification is carried 
out by adding pyroglutamic acid or an amino acid convertible 
into pyroglutamic acid, or a peptide having the N-terminus 
of pyroglutamic acid or an amino acid convertible into 
pyroglutamic acid, to the N-terminus of the protein having 
the endoglucanase activity in which the N-terminus is an 
amino acid other than pyroglutamic acid; 

(3) the method of (1), wherein the modification is carried 
out by substituting pyroglutamic acid or an amino acid 
convertible into pyroglutamic acid, or a peptide having the 
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N-terminus of pyroglutamic acid or an amino acid convertible 
into pyroglutamic acid, for the N-terminal amino acid or an 
N-terminal region of the protein having the endoglucanase 
activity in which the N-terminus is an amino acid other than 
pyroglutamic acid; 

(4) the method of any one of (1) to (3), wherein the protein 
having the endoglucanase activity in which the N-terminus is 
an amino acid other than pyroglutamic acid is a cellulase 
belonging to family 45; 

(5) a modified protein having an endoglucanase activity 
wherein the N-terminal amino acid is converted into 
pyroglutamic acid by an amino acid modification; 

(6) the modified protein of (5), which is obtainable by the 
method of any one of (1) to (4); 

(7) a protein selected from the group consisting of: 

(a) a protein comprising the amino acid sequence of SEQ ID 
NO: 2, 4, 38, or 40; 

(b) a modified protein comprising an amino acid sequence in 
which one or plural amino acids are deleted, substituted, 
inserted, or added in the amino acid sequence of SEQ ID NO: 
2, 4, 38, or 40, and having an endoglucanase activity whose 
reduction in the presence of a surfactant is small; and 

(c) a homologous protein comprising an amino acid sequence 
having at least 85% homology with a protein comprising the 
amino acid sequence of SEQ ID NO: 2, 4, 38, or 40, and 
having an endoglucanase activity whose reduction in the . 
presence of a surfactant is small; 

(8) a polynucleotide encoding the protein of any one of (5) 
to (7); 

(9) a polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of 
SEQ ID NO: 1, 3, 37, or 39; 

(b) a polynucleotide comprising a nucleotide sequence in 
which one or plural nucleotides are deleted, substituted, 
inserted, or added in the nucleotide sequence of SEQ ID NO: 
1, 3, 37, or 39, and encoding a protein having an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small; and 

(c) a polynucleotide hybridizing under stringent conditions 
to a polynucleotide consisting of the nucleotide sequence of 
SEQ ID NO: 1, 3, 37, or 39, and encoding a protein having an 
endoglucanase act ivity whose reduction in the presence of a 
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surfactant is small; 

(10) an expression vector comprising the polynucleotide of 
(8) or (9); 

(11) a host cell transformed with the expression vector of 
(10) ; 

(12) the host cell of (11), wherein the host cell is a yeast 
or filamentous fungus; 

(13) the host cell of (12), the filamentous fungus is a 
microorganism belonging to genus Humicola or Trichoderma ; 

(14) the host cell of (13), the filamentous fungus is 
Humicola insolens or Trichoderma viride; 

(15) a process for producing the protein of any one of (5) 
to (7), comprising: 

cultivating the host cell of any one of (11) to (14), and 
recovering the protein from the host cell or culture 
obtained by the cultivation; and 

(16) a protein produced by the process of (15) . 

EFFECTS OF THE INVENTION 
[0011] 

According to the present invention, it is possible to 
efficiently produce a novel cellulase which is useful as an 
enzyme for the washing of clothing and has an endoglucanase 
activity whose reduction in the presence of a surfactant is 
small . 

BEST MODE FOR CARRYING OUT THE INVENTION 
[0012] 

Protein belonging to family 45 and having endoglucanase 
activity (hereinafter sometimes referred to as "cellulase 
belonging to family 45") 
Family 4 5 

The term "family 45" as used herein means a protein 
classified into family 45 in accordance with the hydrophobic 
cluster analysis of carbohydrate activating enzymes by B. 
Henrissat and A. Bairoch [Henrissat B., Bairoch A. Updating 
the sequence-based classification of glycosyl hydrolases. 
Biochem.J. 316: 695-696 (1996)]. 
[0013] 

Protein having endoglucanase activity 

The term "protein having an endoglucanase activity" as 
used herein means an enzyme exhibiting an endoglucanase 
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activity, i.e., endo-1, 4-0-glucanase (EC3.2.1.4), which 
hydrolyzes the (3-1 , 4-glucopyranosyl bond in (J-l, 4-glucan . 
[0014] 
Surfactant 

The term ^surfactant 7 ' as used herein is a detergent 
component contained in a detergent for clothing, and broadly 
classified into anionic, cationic, and nonionic surfactants. 
Anionic surfactants are commonly used. As preferable 
anionic surfactants used in the present invention, there may 
be mentioned, for example, linear alkyl benzene sulfonate 

(hereinafter sometimes referred to as LAS) . 

[0015] 

Endoglucanase activity 

The term "endoglucanase activity (hereinafter referred 
to as "EGU")" as used herein is defined as an enzyme 
activity obtained by measuring a decrease in the viscosity 
of a carboxymethylcellulose solution in accordance with the 
following procedure. 

As a substrate solution, carboxymethylcellulose 
(Hercules) was dissolved in a 0.1 mol/L Tris-HCl buffer 
(pHlO.O) (final concentration = 3.5%). To the substrate 
solution (5 mL) previously heated at 40°C for 10 minutes, 
0.15 mL of an enzyme solution was added, and then the whole 
was mixed well to carry out the reaction at 40°C for 30 
minutes. The viscosity of the reaction mixture was measured 
by an R type viscometer (RE100; TOKI SANGYO CO., LTD.) at 
40 °C. The "1 unit" of the enzyme activity is defined as an 
amount of enzyme which lowers the initial viscosity to 1/2, 
in each reaction condition. As an anionic surfactant, 
linear alkyl benzene sulfonate (Wako Pure Chemical 
Industries, Co., Ltd.) was used, and added to the 
carboxymethylcellulose solution to a final concentration of 
800 ppm. 
[0016] 

Suppression of reduction in endoglucanase activity in the 
presence of surfactant 

"Having an endoglucanase activity whose reduction in 
the presence of a surfactant is small" as used herein means 
that, when the protein in which the N-terminus thereof is 
modified (N-terminus-modif ication-type protein) according to 
the present invention is compared to the original protein 
before performing the modification (hereinafter simply 
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referred to as the original protein) with respect to the 
endoglucanase activity in the presence of the surfactant, 
the endoglucanase activity of the N-terminus-modif icat ion- 
type protein is higher than that of the original protein. 
[0017] 

Original protein 

The original protein which may be applied to the 
method of the present invention is not particularly limited, 
so long as it is a protein having an endoglucanase activity 
in which the N-terminus is an amino acid other than ■ 
pyroglutamic acid. As the original protein, there may be 
mentioned, for example, a cellulase (for example, 
endoglucanase, cellobiohydrolase, or (3-gulucosidase) , and a 
cellulase belonging to family 45 is preferable. In this 
connection, so long as the original protein has at least an 
endoglucanase activity, it may be a protein having only the 
endoglucanase activity, or a protein having one or more 
other enzyme activities in addition to the endoglucanase 
activity. Further, the. original protein may be a naturally 
occurring protein or a genetically modified protein. 
[0018] 

Original source of cellulase belonging to family 45 

The cellulase belonging to family 45 may be generated 
by commonly used genetic engineering techniques, such as 
recombinant DNA techniques or polypeptide synthesis 
techniques, or may be obtained from an isolated wild-type, 
strain. Further the cellulases include variants of wild- 
type cellulases belonging to family 45. 

The cellulase belonging to family 45 may be obtained 
from a microorganism such as filamentous fungus or 
zygomycetes. As the filamentous fungus, there may be 
mentioned, for example, microorganisms belonging to genus 
Humicola (such as Humicola insolens) , genus Trichoderma 
(such as Trichoderma viride) , genus Staphylotrichum (such as 
Staphylotrichum coccosporum) , or genus Myriococcum (such as 
Myriococcum thermophilum) . More particularly, cellulases 
derived from genus Humicola include, for example, CBH I, EG 
V, NCE2, NCE4, and NCE5; cellulases derived from genus 
Trichoderma include, for example, CBH I, CBH II, EG II, and 
EG III; cellulases derived from genus Staphylotrichum 
include, for example, STCE1 and STCE3; and cellulase derived 
from genus Myriococcum include, for example, MTE1. As the 
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zygomycetes, there may be mentioned, for example, 
microorganisms belonging to genus Rhizopus (such as Rhizopus 
oryzae) , genus Mucor (such as Mucor circinelloides) , or 
genus Phycomyces (such as Phycomyces nitens) . More 
particularly, there may be mentioned, for example, RCE I, 
RCE II, or RCE III derived from Rhizopus oryzae; MCE I or 
MCE II derived from Mucor circinelloides; or PCE I derived 
from Phycomyces nitens (WO00/2487 9) . 
[0019] 

Pyroglutamic acid or peptide containing pyroqlutamic acid 

The term "pyroglutamic acid" as used herein means 
pyroglutamic acid generated by cyclization of N-terminal 
glutamine or glutamic acid of a mature protein. 
Pyroglutamic acid has the feature that the N-terminal amino 
group is not exposed. Pyroglutamate formation can be 
performed in vivo or in vitro. In vivo, a polynucleotide 
encoding a modified protein, which is genetically designed 
so that the N-terminus of a mature protein is glutamine or 
glutamic acid, may be expressed in a host cell to obtain a 
pyroglutamate cyclation protein. In vitro, a protein having 
the N-terminus of glutamine or glutamic acid may be treated 
with an acidic solution such as formic acid to obtain a 
protein having the N-terminus of pyroglutamic acid. 
[0020] 

The term "peptide" as used herein means a compound 
consisting of one or plural amino acids in which the amino 
acids are polymerized by peptide bonds. Therefore, the term 
"peptide containing pyroglutamic acid" as used herein means 
a peptide in which the N-terminal amino acid is pyroglutamic 
acid. The peptide containing pyroglutamic acid consists of 
two or more (plural) crosslinked amino acids, for example, 2 
to 40 amino acids, preferably 2 to 30 amino acids, more 
preferably 2 to 20 amino acids, still further preferably 2 
to 10, still further preferably 2 to 5, most preferably 2 to 
4 amino acids. The amino acids are not particularly 
limited, so long as they can be used by those skilled in the 
art for the stated purpose. 
[0021] 

In the method of the present invention, a method for 
modifying an original protein to a protein having the N- 
terminus of pyroglutamic acid is not particularly limited, 
so long as the protein modification can be performed. As 
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the method, there may be mentioned, for example, genetic 

engineering techniques or chemical techniques. 

[0022] 

According to the genetic engineering techniques, an 
original protein can be modified, for example, by carrying 
out a genetically engineered addition and/or substitution of 
an appropriate amino acid or amino acid sequence. 

In an embodiment utilizing the genetically engineered 
addition, the protein modification can be carried out by 
genetically adding pyroglutamic acid or a peptide having the 
N-terminus of pyroglutamic acid, to the N-terminus of an 
original protein (i.e., a protein having an endoglucanase 
activity in which the N-terminus is not pyroglutamic acid) . 

More particularly, the embodiment utilizing the 
genetically engineered addition may comprise, for example, 
the steps of: 

(1) adding a polynucleotide encoding an amino acid 
convertible into pyroglutamic acid (such as glutamic acid or 
glutamine) or a polynucleotide encoding a peptide having the 
N-terminus of an amino acid convertible into pyroglutamic 
acid, to the 5' terminus of a polynucleotide encoding an 
original protein; and 

(2) expressing the resulting polynucleotide in a host in 
which pyroglutamate formation of the N-terminal amino acid 
can be performed. 

[0023] 

In an embodiment utilizing the genetically engineered 
substitution, the protein modification can be carried out by 
genetically substituting pyroglutamic acid or a peptide 
having the N-terminus of pyroglutamic acid, for the N- 
terminus amino acid or an N-terminal region of an original 
protein . 

More particularly, the embodiment utilizing the 
genetically engineered substitution may comprise, for 
example, the steps of: 

(1) substituting a polynucleotide encoding an amino acid 
convertible into pyroglutamic acid (such as glutamic acid or 
glutamine) or a polynucleotide encoding a peptide having the 
N-terminus of an amino acid convertible into pyroglutamic 
acid, for the 5' terminus or a region containing the same of 
a polynucleotide encoding an original protein; and 

(2) expressing the resulting polynucleotide in a host in 
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which pyroglutamate formation of the N-terminal amino acid 

can be performed. 

[0024] 

As embodiments utilizing the chemical techniques, 
there may be mentioned, for example, 

(a) an embodiment in which pyroglutamic acid (or a peptide 
having the N-terminus of pyroglutamic acid) is directly 
added to the N-terminus of an original protein; 

(b) an embodiment in which an amino acid convertible into 
pyroglutamic acid (or a peptide having the N-terminus of an 
amino acid convertible into pyroglutamic acid) is chemically 
added to the N-terminus of an original protein, and then 
pyroglutamate formation of the N-terminal amino acid is 
performed chemically; or 

(c) an embodiment in which the N-terminal amino acid of an 
original protein having the N-terminus of an amino acid 
convertible into pyroglutamic acid is chemically converted 
to pyroglutamic acid. 

[0025] 

Method for adding pyroglutamic acid or peptide containing 
pyroglutamic acid to the N-terminal side of cellulase 
belonging to family 45 

The modification method will be further illustrated by 
an embodiment using cellulase belonging to family 45. A 
method for adding pyroglutamic acid or a peptide containing 
pyroglutamic acid to the N-terminal side of cellulase 
belonging to family 45 may be carried out by genetic 
engineering techniques. In a commonly used cellulase 
production, a coding region of a polynucleotide encoding 
desired cellulase may be operably linked between a promoter 
and a terminator functionable in a host such as a 
filamentous fungus, and then the resulting expression 
cassette may be introduced into the host. Further, a 
polynucleotide encoding a signal sequence for secretion 
functionable in the host cell may be added to the cassette. 
When the cassette is introduced into the host cell, the 
desired cellulase is secreted into a medium, and then can be 
easily collected. In this case, a desired amino acid can be 
added to the N-terminus of cellulase, by adding a 
polynucleotide encoding the amino acid immediately 
downstream from the signal sequence for secretion. Further, 
the modification of the amino group in the N-terminal amino 
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acid may be carried out by utilizing a signal sequence for 
secretion in a host. For example, in cbhl or cbh2 derived 
from Trichoderma viride, or NCE2 or NCE5 derived from 
Humicola insolens , the N-terminus is formed pyroglutamate , 
the modification can be carried out by using these signal 
sequences for secretion and expressing in Trichoderma viride 
or Humicola insolens . According to the preferable 
embodiment, modified cellulase belonging to family 45 
prepared as described above exhibits an advantageous 
feature, i.e., it has an endoglucanase activity whose 
reduction in the presence of a surfactant is small. 
[0026] 

According to another embodiment, the whole can be 
chemically synthesized, within the scope of technical common 
knowledge of those skilled in the art. In this case, the 
synthesis may be carried out using part of a naturally- 
occurring protein. 
[0027] 

Protein of the present invention 

The protein of the present invention is prepared by 
modifying the N-terminus of an original protein having an 
endoglucanase activity to pyroglutamic acid [for example, 
prepared by obtaining cellulase belonging to family 45 and 
adding pyroglutamic acid (pQ) or a peptide containing 
pyroglutamic acid to the N-terminal amino acid side of the 
mature protein thereof] , and has an endoglucanase activity 
whose reduction in the presence of a surfactant is small. 
Further, the present invention includes a protein which may 
be prepared by the above-described method and has an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small. 
[0028] 

More particularly, the protein of the present 
invention includes a protein selected from the group 
consisting of the following proteins: 

(a) a protein comprising the amino acid sequence of SEQ ID 
NO: 2, 4, 38, or 40; 

(b) a modified protein comprising an amino acid sequence in 
which one or plural amino acids are deleted, substituted, 
inserted, or added in the amino acid sequence of SEQ ID NO: 
2, 4, 38, or 40, and having an endoglucanase activity whose 
reduction in the presence of a surfactant is small; and 
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(c) a homologous protein comprising an amino acid sequence 
having at least 85% homology with a protein comprising the 
amino acid sequence of SEQ ID NO: 2, 4, 38, or 40, and 
having an endoglucanase activity whose reduction in the 
presence of a surfactant is small. 

[0029] 

The amino acid sequence of SEQ ID NO: 2 is the amino 
acid sequence of N-terminus-modif ied NCE4 (see Example A2) 
in which a peptide consisting of five amino acids (N- 
terminus: pyroglutamic acid) is added to the N-terminus of 
endoglucanase NCE4 derived from Humicola insolens MN200-1. 

The amino acid sequence of SEQ ID NO: 4 is the amino 
acid sequence of N-terminus-modif ied STCE1 (see Example B2) 
in which a peptide consisting of four amino acids (N- 
terminus: pyroglutamic acid) is substituted for the N- 
terminal amino acid (Ala) of endoglucanase STCE1 derived 
from Staphylotrichum coccosporum IFO 31817. 

The amino acid sequence of SEQ ID NO: 38 is the amino 
acid sequence of N-terminus-modif ied STCE1 (see Example B3) 
in which pyroglutamic acid is added to the N-terminus of 
endoglucanase STCE1 derived from Staphylotrichum coccosporum 
IFO 31817. 

The amino acid sequence of SEQ ID NO: 40 is the amino 
acid sequence of N-terminus-modif ied STCE1 (see Example B4) 
in which a peptide consisting of four amino acids (N- 
terminus: pyroglutamic acid) is added to the N-terminus of 
endoglucanase STCE1 derived from Staphylotrichum coccosporum 
IFO 31817. 
[0030] 

The term "modified protein" as used herein means a 
protein comprising an amino acid sequence in which one or 
plural amino acids (for example, one or several amino acids) 
are deleted, substituted, inserted, or added in the amino 
acid sequence of SEQ ID NO: 2, 4, 38, or 40, and having an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small. The number of amino acids to be 
modified, such as "deleted, substituted, inserted, or 
added", is preferably 1 to 30, more preferably 1 to 10, most 
preferably 1 to 6. 
[0031] 

Further, the modified protein includes a protein 
comprising an amino acid sequence in which one or plural 
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amino acids are conservatively substituted in the amino acid 
sequence of SEQ ID NO: 2, 4, 38, or 40, and having an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small. The term "conservative substitution" 
as used herein means that one or plural amino acid residues 
contained in a protein are replaced with different amino 
acids having similar chemical properties so that the 
activities of the protein are not substantially changed. As 
the conservative substitution, there may be mentioned, for 
example, a substitution of a hydrophobic amino acid residue 
for another hydrophobic amino acid residue, or a 
substitution of a polar amino acid residue for another polar 
amino acid residue having the same charge. Amino acids 
which have similar chemical properties and can be 
conservatively substituted with each other are known to 
those skilled in the art. More particularly, as nonpolar 
(hydrophobic) amino acids, there may be mentioned, for 
example, alanine, valine, isoleucine, leucine, proline, 
tryptophan, phenylalanine, or methionine. As polar 
(neutral) amino acids, there may be mentioned, for example, 
glycine, serine, threonine, tyrosine, glutamine, asparagine, 
or cysteine. As basic amino acids having a positive charge, 
there may be mentioned, for example, arginine, histidine, or 
lysine. As acidic amino acids having a negative charge, 
there may be mentioned, for example, aspartic acid or 
glutamic acid. 
[0032] 

The term "homologous protein" as used herein means a 
protein comprising an amino acid sequence having at least 
85% (preferably 90% or more, most preferably 95% or more) 
homology (sequence identity) with a protein comprising the 
amino acid sequence of SEQ ID NO: 2, 4, 38, or 40, and 
having an endoglucanase activity whose reduction in the 
presence of a surfactant is small. The homology as used 
herein is shown as the value (identity) calculated by FASTA3 
[Science, 227, 1435-1441(1985); Proc . Natl . Acad . Sci . USA, 85, 
24 4 4-24 4 8 (1988) ; http://www.ddbj . nig . ac . j p/E-mail /homology- 
j.html], a known homology search program, in accordance with 
default parameters . 
[0033] 

As described above, in the "protein consisting of the 
amino acid sequence of SEQ ID NO: 2" of the present 
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invention, the portion except the N-terminal peptide 
consisting of five amino acids is derived from endoglucanase 
NCE4 . Further, in the "protein consisting of the amino acid 
sequence of SEQ ID NO: 4, 38, or 40" of the present 
invention, the portion except the N-terminal amino acid or 
peptide is derived from endoglucanase STCE1. Endoglucanases 
NCE4 and STCE1 belong to family 45. As known endoglucanase 
belonging to family 45, there may be mentioned, for example, 
NCE5 derived from genus Humicola (WO01/90375) . 
[0034] 

In Figures 1 and 2, an alignment of the amino acid 
sequences of endoglucanase STCE1 [signal peptide (SEQ ID NO: 
43) and mature protein (SEQ ID NO: 44)], endoglucanase NCE4 
[signal peptide (SEQ ID NO: 45) and mature protein (SEQ ID 
NO: 46)], and endoglucanase NCE5 [signal peptide (SEQ ID NO: 
47) and mature protein (SEQ ID NO: 48)] is shown. 

Figure 1 shows the alignment of the N-terminal half, 
and Figure 2 shows that of the C-terminal half. The symbol 

i n Figures 1 and 2 indicates an amino acid common to 
that in STCE1. 
[0035] 

As shown in Figures 1 and 2, endoglucanases belonging 
to family 45 contain the catalytic domain (1st to 207th) as 
a common domain, and sometimes contain the Linker region 
(208th to 258th) and/or the cellulose-binding domain (CBD) 
(259th to 295th). In this connection, the numbers in 
parentheses after the above domains represent the amino acid 
numbers in the amino acid sequence (SEQ ID NO: 44) of 
endoglucanase STCE1 . 
[0036] 

In each region, there are many conservative amino 
acids between or among endoglucanases in the catalytic 
domain and the cellulose-binding domain, but no remarkable 
conservative region is observed in the Linker region. 
Regions containing many conservative amino acids (for 
example, the catalytic domain or cellulose-binding domain, 
particularly the catalytic domain) , or common amino acids 
contained in the regions are considered as important regions 
or amino acids for the enzyme activity of endoglucanases 
(such as STCE1 or NCE4). Therefore, when an amino acid 
modification (for example, deletion, substitution, 
insertion, and/or addition, particularly conservative 
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substitution) is carried out in a region or amino acid other 
than such important regions or amino acids, a modified or 
homologous protein maintaining the enzyme activity can be 
obtained with a high possibility, without undue experiment. 
[0037] 

Further, even if in the regions containing many 
conservative amino acids, a modification of noncommon amino 
acid(s) between or among endoglucanases to different amino 
acid(S) [preferably amino acid(s) which are similar and can 
be conservatively substituted] may probably maintain the 
enzyme activity. Therefore, by such a modification, a 
modified or homologous protein maintaining the enzyme 
activity can be obtained with a high possibility, without 
undue experiment. 

In this connection, even if common amino acid(s) in 
the region containing many conservative amino acids are 
modified to different amino acid(s), the enzyme activity is 
sometimes maintained. Particularly, in a modification to 
amino acid(s) which are similar and can be conservatively 
substituted, the possibility is increased. The modified or 
homologous protein of the present invention includes a 
protein in which one or more amino acids contained in any 
region, such as the catalytic domain, Linker region, or 
cellulose-binding domain, are modified, so long as it 
exhibits an endoglucanase activity. 
[0038] 

Polynucleotide encoding the protein of the present invention 

According to the present invention, a polynucleotide 
encoding a protein comprising the amino acid sequence of SEQ 
ID NO: 2, 4, 38, or 40, or a modified or homologous protein 
thereof (hereinafter collectively referred to as the protein 
of the present invention) may be provided. When the amino 
acid sequence of a protein is given, a nucleotide sequence 
encoding the amino acid sequence can be easily selected, and 
thus various nucleotide sequences encoding the protein of 
the present invention can be selected. The term 
"polynucleotide" as used herein includes DNA and RNA, and 
DNA is preferable. 
[0039] 

The polynucleotide of the present invention includes a 
polynucleotide selected from the group consisting of the 
following polynucleotides: 
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(a) a polynucleotide comprising the nucleotide sequence of 
SEQ ID NO: 1, 3, 37, or 39; 

(b) a polynucleotide comprising a nucleotide sequence in 
which one or plural nucleotides are deleted, substituted, 
inserted, or added in the nucleotide sequence of SEQ ID NO: 
1, 3, 37, or 39, and encoding a protein having an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small; and 

(c) a polynucleotide hybridizing under stringent conditions 
to a polynucleotide consisting of the nucleotide sequence of 
SEQ ID NO: 1, 3, 37, or 39, and encoding a protein having an 
endoglucanase activity whose reduction in the presence of a 
surfactant is small. 

[0040] 

In the nucleotide sequence described in the above item 
(b) , the number of nucleotides to be deleted, substituted, 
inserted, or added is, for example, 1 to 90, preferably 1 to 
30, more preferably 1 to 18, most preferably 1 to 9 . 
[0041] 

The term "under stringent conditions" as used herein 
means the following conditions. In accordance with a 
protocol attached to an ECL direct DNA/RNA labeling and 
detection system (Amersham) , after a polynucleotide to be 
tested is prehybridized at 42 °C for an hour, a 
polynucleotide consisting of the nucleotide sequence of SEQ 
ID NO: 1, 3, 37, or 39 is added, and hybridization is 
carried out at 42°C for 14 to 16 hours. After the 
hybridization, a washing treatment with 0.5*SSC (1><SSC; 15 
mmol/L sodium citrate, 150mmol/L sodium chloride) containing 
0.4% SDS and 6 mol/L urea at 42 °C for 20 minutes is repeated 
twice, and a washing treatment with 5*SSC at room 
temperature for 5 minutes is carried out twice. 
[0042] 

The polynucleotide of the present invention includes a 
naturally-occurring polynucleotide. Further, the whole can 
be synthesized. Furthermore, the synthesis may be carried 
out using part of the naturally-occurring polynucleotide. 
Typically, the polynucleotide of the present invention may 
be obtained by screening a genomic library derived from a 
desired microorganism in accordance with an ordinary method 
commonly used in genetic engineering, for example, using an 
appropriate DNA probe designed on the basis of information 
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of a partial amino acid sequence. 
[0043] 

Production of the protein of the present invention 

The protein of the present invention can be produced 
in host cells by transforming the host cell with a 
polynucleotide molecule (particularly as a form of 
expression vector) comprising a polynucleotide fragment 
encoding the protein so that the polynucleotide molecule may 
be replicated and the gene may be expressed in the host 
cell. 
[0044] 

According to the present invention, an expression 
vector comprising a polynucleotide fragment encoding the 
protein of the present invention so that the polynucleotide 
fragment may be replicated and the protein may be expressed 
in a host microorganism, is provided. 
[0045] 

The expression vector of the present invention can be 
constructed on the basis of a self -replicating vector (such 
as a plasmid) , which exists as an extrachromosomal element 
and can replicate independently of the replication of 
chromosomes. Alternatively, the expression vector of the 
present invention may be a vector which is integrated into 
the chromosome of the host microorganism and replicated 
together with chromosomes, when the host is transformed with 
the vector. The construction of the vector of the present 
invention can be carried out by ordinary procedures or 
methods commonly used in genetic engineering. 
[0046] 

To express a protein having a desired activity by 
transforming a host microorganism with the expression vector 
of the present invention, it is preferable that the 
expression vector contains, for example, a polynucleotide 
capable of controlling the expression, or a genetic marker 
to select transf ormants, in addition to the polynucleotide 
fragment of the present invention. 
[0047] 

As the polynucleotide capable of regulating the gene 
expression, various signals for the transcription or 
translation regulation, such as a promoter, a terminator, or 
a polynucleotide encoding a signal peptide for secretion, 
may be used in the present invention. The ligation of these 
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polynucleotides and the insertion thereof to a vector can be 

carried out by an ordinary method. 

[0048] 

The promoter which can be used in the present 
invention is not particularly limited, so long as it shows a 
transcriptional activity in a host microorganism. The 
promoter can be obtained as a polynucleotide which regulates 
the expression of a gene encoding a protein the same as or 
different from that derived from the host microorganism. 
For example, a promoter such a lactose operon or a 
tryptophan operon can be used in Escherichia coli; a 
promoter of an alcohol dehydrogenase gene, an acid 
phosphatase gene, a galactose utilization gene, or a 
glyceraldehyde 3-phosphate dehydrogenase gene can be used in 
a yeast; and a promoter of an a-amylase gene, a glucoamylase 
gene, a cellobiohydrolase gene, or a glyceraldehyde 3- 
phosphate dehydrogenase gene can be used in a filamentous 
fungus . 
[0049] 

The signal peptide is not particularly limited, so 
long as it contributes to the protein secretion in a host 
microorganism. The signal peptide can be obtained as a 
polynucleotide derived from a gene encoding a protein same 
as or different from that derived from the host 
microorganism. 
[0050] 

The selectable marker can be appropriately selected in 
accordance with the method for selecting a transf ormant . As 
the selectable marker, for example, a drug resistance gene 
or a gene complementing an auxotrophic mutation can be used 
in the present invention. When a host is a bacterium, for 
example, an ampicillin resistance gene, a kanamycin 
resistance gene, or a tetracycline resistance gene can be 
used. When a host is a yeast, for example, a tryptophan 
biosynthesis gene (trpl, trpC) , an uracil biosynthesis gene 
{ura3) , a leucine biosynthesis gene (leu2) , or a histidine 
biosynthesis gene (his3) can be used. When a host is a 
filamentous fungus, for example, a destomycin resistance 
gene, a nitrate utilization gene (niaD) , an arginine 
biosynthesis gene (argS) , an uracil biosynthesis gene 
(pyr4) , a hygromycin resistance gene, a bialaphos resistance 
gene, a bleomycin resistance gene, or an aureobasidin 
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resistance gene can be used. 
[0051] 

According to the present invention, a microorganism 
transformed with the expression vector is provided. A host- 
vector system which can be used in the present invention is 
not particularly limited. For example, a system utilizing 
E. coli, Actinomycetes, yeasts, or filamentous fungi, or a 
system for the expression of a fusion protein using such a 
microorganism can be used. When a yeast is used as a host, 
there may be mentioned, for example, a microorganism 
belonging to genus Saccharomyces , Hansenula, or Pichia, 
preferably Saccharomyces cerevisiae. When a filamentous 
fungus is used as a host, any filamentous fungus, preferably 
a microorganism belonging to genus Humicola, Trichoderma , ' 
Aspergillus, Acremonium, or Fusarium, more preferably a 
microorganism belonging to genus Humicola or Trichoderma , 
may be used. More particularly, Humicola insolens, Humicola 
thermoidea, Trichoderma viride, Trichoderma reesei, 
Tri choderma 1 on gibra ch latum, Staphyl otrich urn coccosporum , 
Aspergillus niger, Aspergillus oryzae, Fusarium oxysporum, 
or Acremonium cellulolyticus , preferably Humicola insolens 
or Trichoderma viride, may be used. 
[0052] 

Transformation of a microorganism with the expression 
vector can be carried out in accordance with an ordinary 
method such as a calcium ion method, a lithium ion method, 
an electroporation method, a PEG method, an Agrobacterium 
method, or a particle gun method, and an appropriate method 
can be selected in accordance with a host to be transformed. 
[0053] 

In the present invention, the transformant of the 
present invention is cultured, and the resulting culture is 
used to obtain the protein of the present invention of 
interest. The transformant of the present invention can be 
cultured in accordance with an ordinary method by 
appropriately selecting, for example, a medium or culture 
conditions . 
[0054] 

The medium can be supplemented with, for example, 
carbon sources or nitrogen sources which can be anabolized 
or utilized, respectively, by the transformant of the 
present invention, inorganic salts, various vitamins, 
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various amino acids such as glutamic acid or asparagine, 
trace nutrients such as nucleotides, or selective agents 
such as antibiotics. Further, organic and/or inorganic 
substances which help the growth of the transformant of the 
present invention or promote the production of the protein 
of the present invention can be appropriately added. 
Further, if necessary, a natural medium or synthetic medium 
which appropriately contains other nutrients can be used. 
[0055] 

Any kind of carbon source and nitrogen source can be 
used in the medium as long as it can be utilized by the 
transformant of the present invention. As the anabolizable 
carbon source, various carbohydrates, for example, sucrose, 
starch, glycerin, glucose, fructose, maltose, lactose, 
cellulose, cellobiose, dextrin, animal oils, or plant oils, 
or hydrolysates thereof, can be used. Generally, the 
concentration thereof to the medium is preferably 0.1% to 
5%. As the utilizable nitrogen source) for example, animal 
or plant components or extracts thereof, such as peptone, 
meat extract, corn steep liquor, or defatted soybean powder, 
organic acid ammonium salts such as succinic acid ammonium 
salts or tartaric acid ammonium salts, urea, or other 
various nitrogen-containing compounds such as inorganic or 
organic acids, can be used. As the inorganic salts, for 
example, those which can produce sodium, potassium, calcium, 
magnesium, cobalt, chlorine, phosphoric acid, sulfuric acid, 
or other ions can be appropriately used. 
[0056] 

Any medium which contains other components, for 
example, cells, exudates, or extracts of microorganisms such 
as yeast, or fine plant powders, can be appropriately used, 
as long as they don't interfere with the growth of the 
transformant of the present invention and with the 
production and accumulation of the protein of the present 
invention. When a mutant strain having a nutritional 
requirement is cultured, a substance to satisfy its 
nutritional requirement is generally added to the medium. 
In this connection, such a nutrient may not necessarily be 
added when a medium containing natural substances is used. 
[0057] 

The culture conditions, such as the medium component, 
medium fluidity, incubation temperature, agitation speed, or 
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aeration rate, can be appropriately selected and controlled 
in accordance with the transformant to be used and the 
exterior conditions so as to obtain preferable results. If 
foaming occurs in a liquid medium, a defoaming agent such as 
silicone oils, vegetable oils, mineral oils, or surfactants 
can be appropriately used. 
[0058] 

The protein of the present invention accumulated in 
the resulting culture is contained in the transformant of 
the present invention and the culture filtrate. 
Accordingly, it is possible to recover the protein of the 
present invention from both culture filtrate and 
transformant cells by separating the culture into each 
fraction by centrif ugation . 
[0059] 

The protein of the present invention can be recovered 
from the culture filtrate according to an ordinary method. 
Procedures for recovering the protein of the present 
invention from the culture can be carried out singly, in 
combination in a certain order, or repeatedly. For example, 
extraction filtration, centrif ugation, dialysis, 
concentration, drying, freezing, adsorption, desorption, 
means for separation based on the difference in solubility 
in various solvents (such as precipitation, salting out, 
crystallization, recrystallization, reverse solution, or 
chromatography) can be used. 
[0060] 

Further, the protein of the present invention can be 
obtained from the culture inside the transformant of the 
present invention. For example, extraction from the culture 

(for example, grinding treatment or disruption by pressure) , 
recovery (for example, filtration or centrif ugation) , or 
purification (for example, salting out or solvent 
precipitation) can be carried out using an ordinary method. 

[0061] 

The resulting crude substance can be purified 
according to an ordinary method, for example, by a column 
chromatography using a carrier such as dextran, cellulose, 
agarose, synthetic polymers, or silica gel. A pure protein 
of the present invention of interest can be obtained from 
the culture of the transformant of the present invention 
using the above-mentioned methods, either singly or in an 



(23) 



appropriate combination . 
[0062] 

As described above, according to another embodiment of 
the present invention, the process for producing the novel 
protein of the present invention can be provided. 
Cultivation of the transformant (including culturing 
conditions) can be carried out in a fashion substantially 
similar to that of the original host used to prepare the 
transformant. As the method for recovering the protein of 
interest after the cultivation of the transformant, commonly 
used procedures can be carried out. 
[0063] 

Deposited strains 

Escherichia coli JM109 transformed with expression 
vector pEGDOl of the present invention ( FERM BP-5973) was 
domestically deposited in the International Patent Organism 
Depositary National Institute of Advanced Industrial Science 
and Technology [ (Former Name) National Institute of 
Bioscience and Human-Technology Agency of Industrial Science 
and Technology (Address: AIST Tsukuba Central 6, 1-1, 
Higashi 1-chome Tukuba-shi, Ibaraki-ken 305-8566 Japan) ] on 
July 12, 1996, and was transferred to an international 
deposit on June 13, 1997. The international deposit number 
(a number in parenthesis [] following the international 
deposit number is a domestic deposit number) is FERM BP-5973 
[FERM P-15729] . 
[0064] 

Escherichia coli JM109 transformed with expression 
vector pMKDOl of the present invention (FERM BP-5974) was 
domestically deposited in the International Patent Organism 
Depositary National Institute of Advanced Industrial Science 
and Technology [(Former Name) National Institute of 
Bioscience and Human-Technology Agency of Industrial Science 
and Technology (Address: AIST Tsukuba Central 6, 1-1, 
Higashi 1-chome Tukuba-shi, Ibaraki-ken 305-8566 Japan)] on 
July 12, 1996, and was transferred to an international 
deposit on June 13, 1997. The international deposit number 
(a number in parenthesis [] following the international 
deposit number is a domestic deposit number) is FERM BP-5974 
[FERM P-15730] . 
[0065] 

Escherichia coli JM109 transformed with expression 
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vector pCBl-M2XR of the present invention (FERM BP-6045) was 
domestically deposited in the International Patent Organism 
Depositary National Institute of Advanced Industrial Science 
and Technology [(Former Name) National Institute of 
Bioscience and Human-Technology Agency of Industrial Science 
and Technology (Address: AIST Tsukuba Central 6, 1-1, 
Higashi 1-chome Tukuba-shi, Ibaraki-ken 305-8566 Japan) ] on 
September 9, 1996, and was transferred to an international 
deposit on August 11, 1997. The international deposit 
number (a number in parenthesis [] following the 
international deposit number is a domestic deposit number) 
is FERM BP-6045 [FERM P-15840] . 

In this connection, plasmid pCBl-M2 of the present 
invention can be obtained, not only by the method described 
in EXAMPLES, but also by self -ligation of the plasmid pCBl- 
M2XR previously digested with restriction enzyme Xbal. 
[0066] 

Humicola insolens MN200-1 (FERM BP-5977), which may be 
used as a host for the expression vector of the present 
invention, was domestically deposited in the International 
Patent Organism Depositary National Institute of Advanced 
Industrial Science and Technology [(Former Name) National 
Institute of Bioscience and Human-Technology Agency of 
Industrial Science and Technology (Address: AIST Tsukuba 
Central 6, 1-1, Higashi 1-chome Tukuba-shi, Ibaraki-ken 305- 
8566 Japan) ] on July 15, 1996, and was transferred to an 
international deposit on June 13, 1997. The international 
deposit number (a number in parenthesis [] following the 
international deposit number is a domestic deposit number) 
is FERM BP-5977 [FERM P-15736] . 
[0067] 

Trichoderma viride MC300-1 (FERM BP-6047), which may 
be used as a host for the expression vector of the present 
invention, was domestically deposited in the International 
Patent Organism Depositary National Institute of Advanced 
Industrial Science and Technology [(Former Name) National 
Institute of Bioscience and Human-Technology Agency of 
Industrial Science and Technology (Address: AIST Tsukuba 
Central 6, 1-1, Higashi 1-chome Tukuba-shi, Ibaraki-ken 305- 
8566 Japan) ] on September 9, 1996, and was transferred to an 
international deposit on August 11, 1997. The international 
deposit number (a number in parenthesis [] following the 
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international deposit number is a domestic deposit number) 

is FERM BP-6047 [FERM P-15842] . 

[0068] 

Escherichia coli DH5a transformed with expression 
vector pUC118-STCEex of the present invention (FERM BP- 
10127) was domestically deposited in the International 
Patent Organism Depositary National Institute of Advanced 
Industrial Science and Technology (Address: AIST Tsukuba 
Central 6, 1-1, Higashi 1-chome Tukuba-shi, Ibaraki-ken 305- 
8566 Japan) on December 1, 2003, and was transferred to an 
international deposit on September 15, 2004. The 
international deposit number (a number in parenthesis [] 
following the international deposit number is a domestic 
deposit number) is FERM BP-10127 [FERM P-19602] . In this 
connection, plasmid pUC118-STCEex is a plasmid obtained by 
inserting the STCE gene into the BamHI site of plasmid 
pUC118. The endoglucanase STCE1 gene present in the BamHI 
fragment contains an intron and has the sequence of SEQ ID 
NO: 5. 

EXAMPLES 
[0069] 

The present invention now will be further illustrated 
by, but is by no means limited to, the following Examples. 
[0070] 

Example Al : Construction of expression vectors for 
expressing N-terminus-modif ied cellulase and wild-type 
cellulase in Humicola insolens 

(1) Construction of expression vector pMKDOl for expressing 
N-terminus-modif ied cellulase 

Plasmid pUC118 (Takara Bio) was digested with BamHI . 
The resulting fragment was blunted with a DNA blunting kit 
(Takara Bio) , and self-ligated with a DNA ligation kit 
(Takara Bio) to obtain pUC118-BN. The pUC118-BN was 
digested with SphI, blunted, and self-ligated to obtain 
pUC118BSN. Next, a 3.4-kb Pstl-Xbal fragment comprising the 
cellulase NCE2 gene was obtained from Humicola insolens 
MN200-1 (FERM BP-5977) in accordance with the method 
described in Japanese Unexamined Patent Publication (Kokai) 
No. 8-126492. The fragment was ligated with pUC118BSN 
previously digested with the same restriction enzyme to 
obtain pUC118BSN-PX . Site-directed mutation was introduced 
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into pUC118BSN-PX with a Sculptor in vitro mutagenesis 
system (Amersham Bioscience) to obtain plasmid pM21. As 
oligonucleotides for mutagenesis, oligonucleotides MNC-02 
and MNC-03 were used. 
[0071] 

MNC-02: 5 ' -GAGCGCCAGAACTGTGGATCCACTTGGTGAGCAATG- 3 ' (SEQ NO: 
6) 

MNC-03: 5 ' -TCCGCCGTTCTGAGCGGATCCAGGCGTTTGGCGCG- 3 ' (SEQ NO: 
7) 

[0072] 

The plasmid pM21 was digested with BamHI, and ligated 
with a Ea^HI-digested PCR fragment of a cellobiohydrolase 
gene (NCE3) derived from Humicola insolens MN200-1 to obtain 
plasmid pKM04. The PCR fragment of the NCE3 was amplified 
using genomic DNA (WO98/03640) from Humicola insolens MN200- 
1 (FERM BP-5977) as a template and the following primers 
MKA-05 and MKA-06. 
[0073] 

MKA-05: 5 ' -GCCGCCCAGCAGGCGGGATCCCTCACCACCGAGAGG-3 ' (SEQ NO: 
8) 

MKA-06: 5 ' -TGATCGTCGAGTCAGGGATCCAGAATTTACAGGCAC-3 ' (SEQ NO: 
9) 

[0074] 

Next, a promoter and a terminator (Mullaney , E.J. 
et.al., Mol. Gen. Genet. , 199, 37-45, 1985) of the trpC gene 
derived from Aspergillus nidulans were used to prepare a 
gene capable of expressing a destomycin resistance gene in 
Humicola insolens . The resulting gene was ligated with 
plasmid pKM04 previously digested with Xbal to construct 
expression vector pMKDOl (FERM BP-5974) for expressing N- 
terminus-modif ied cellulase. In pMKDOl , the BamHI 
recognition sites were introduced at the 10 bp downstream 
from the N-terminus of the mature protein of the NCE2 gene 
and at the 3 bp downstream from the stop codon thereof, and 
thus five amino acid residues can be added to the N-terminus 
of a desired cellulase belonging to family 45. 
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[0075] 
Table 1 

Bam HI Bam HI 

5'- GAGCGCCAGAACTG TGGATCC CTC TGCCTGTAAg cggatcc ag g-3' ( SEQ NO: 10) 

Gl uAr gGl n AsnCy sGl ySerLeu CysLeu (SE Q NO: 1X > 

fN-terminus of mature protein 

[0076] 

(2) Construction of expression vector pJDOl for expressing 
wild- type cellulase 

Site-directed mutation was introduced, using 
oligonucleotide pMN-Bam, into plasmid pM21 obtained in the 
above item (1) to obtain plasmid pM21-m-Al. 

[0077] 
pMN-Bam: 

5 ' -GGTCAAACAAGTCTGTGCGGATCCTGGGACAAGATGGCCAAGTTCTTCCTTAC- 3 ' 

(SEQ NO: 12) 

[0078] 

Plasmid pM21-m-Al was digested with restriction 

enzymes Hindlll and BamHI , and the resulting fragment of 

approximately 1 kb was recovered. Next, pMKDOl obtained in 

Example Al(l) was digested with Jfindlll and BamHI , and a 

fragment of approximately 7 kb was recovered. These 

fragments were ligated to obtain plasmid pJDOl. In this 

vector, the BamHI recognition sites were introduced at the 

15 bp upstream from the translation initiation site of the 

NCE2 gene and at the 3 bp downstream from the stop codon 

thereof, and thus a desired cellulase belonging to family 45 

was expressed from the transcription initiation point. 

[0079] 
Table 2 

Bam HI Bam HI 

5'-tg cgga tec tgggacaagATGGCC CCGTTCTGAg c ggatcc ag g~3' (SEQ NO: 13) 

MetAla ProPhe (SEQ NO: 14) 

[0080] 

Example A2 : Construction of expression vectors for 
expressing wild-type NCE4 and N-terminus-modif ied NCE4 , 
transformation of Humicola insolens therewith, and 
evaluation of activities 
(1) Construction of expression vectors 

As an expression vector for wild-type NCE4, the coding 
region of the NCE4 gene was ligated with pJDOl obtained in 
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Example Al(2) to construct pN2EX-N4 . 

First, the coding region of the NCE4 gene was 
amplified by carrying out PCR using genomic DNA from 
Humicola insolens MN200-1 as a template and NCE4 -Ne and 
NCE4-Ce as primers. The resulting PCR product of 
approximately 0.9 kbp was digested with BamHl, and ligated 
with pJDOl previously digested with the same restriction 
enzyme to construct pN2EX-N4 . 
[0081] 

NCE4-Ne: 5 ' -GGGGGGATCCTGGGACAAGATGCGTTCCTCCCCTCTC-3 ' (SEQ NO: 
15) 

NCE4-Ce: 5 ' -GGGGGGATCCCTGCGTTTACAGGCACTGATGG- 3 ' (SEQ NO: 16) 
[0082] 

As an expression vector for N-terminus-modif ied NCE4 , 
the coding region of the mature protein in which the signal 
sequence for secretion was deleted in the NCE4 gene was 
ligated with pMKDOl ( FERM BP-5974) obtained in Example Al ( 1 ) 
to construct pEGDOl (FERM BP-5973) . First, the coding 
region of the mature protein part of the NCE4 gene was 
amplified by carrying out PCR using genomic DNA from 
Humicola insolens MN200-1 as a template and NCE4-Ns and 
NCE4 -Cs as primers. The resulting PCR product of 
approximately 0.8 kbp was digested with BamHl, and ligated 
with pMKDOl previously digested with the same restriction 
enzyme to construct pEGDOl. 
[0083] 

NCE4-Ns: 5 ' -CCGGTGTTGGCCGGATCCGCTGATGGCAAG- 3 ' (SEQ NO: 17) 
NCE4 -Cs : 5 ' -TAAGGCCCTCAAGGATCCCTGCGTCTACAG- 3 ' (SEQ NO: 18) 
[0084] 

(2) Transformant containing wild-type NCE4 or N-terminus- 
modified NCE4 

Humicola insolens MN200-1 was cultured in an S medium 
(3.0% glucose, 2.0% yeast extract, 0.1% polypeptone, 0.03% 
calcium chloride, and 0.03% magnesium sulfate; pH6.8) at 
37 °C for 24 hours. The resulting mycelia were collected by 
centrif ugation, washed with 0.5 mol/L sucrose, and suspended 
in 0.5 mol/L sucrose solution containing 3 mg/mL (5- 
glucuronidase (Sigma) , 1 mg/mL chitinase, and 1 mg/mL 
zymolyase 20T (Seikagaku Corp.). The suspension was gently 
shaken at 30 °C for 60 to 90 minutes to generate protoplasts. 
The resulting protoplasts were collected by filtration and 
centrif ugation, washed with an SUTC buffer (0.5 mol/L 
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sucrose, 10 mmol/L calcium chloride, and 10 mmol/L Tris-HCl; 
pH7.5), and resuspended in SUTC to prepare a protoplast 
suspension . 

To 100 \iL> of the protoplast suspension, 10 of each 
DNA solution (pEGDOl or pN2EX-N4) was added. After each was 
allowed to stand on ice for 5 minutes, 400 of a PEG 
solution (60% polyethylene glycol 4000, 10 mmol/L calcium 
chloride, and 10 mmol/L Tris-HCl; pH7.5) was added, and the 
whole was allowed to stand on ice for 20 minutes. After the 
resulting protoplast suspension was washed with the SUTC 
buffer, a YMG medium (1% glucose, 0.4% yeast extract, 0.2% 
malt extract, and 1% agar; pH6.8) containing 200 }jg/mL 
hygromycin B was overlaid with the protoplast suspension 
together with YMG soft agar. The plate was incubated at 
37 °C for 5 days to obtain transf ormants as colonies. 
[0085] 

(3) Comparison of endoglucanase activities of wild-type NCE4 

and N-terminus-modif ied NCE4 in culture supernatants 

The resulting transf ormants were cultured in 

accordance with the method described in WO98/03667, to 

obtain culture supernatants. The culture supernatants were 

subjected to SDS-PAGE, to select culture supernatants in 

which the NCE4 band of approximately 43 kDa was clearly 

detected. The endoglucanase activities in the culture 

supernatants at pHIO in the absence and presence of LAS were 

measured, and the degrees of a reduction in the activity in 

the presence of LAS were compared. The results are shown in 

Table 3. In Table 3, the endoglucanase activities are 

represented by percentages when each endoglucanase activity 

at pHIO is regarded as 100. 

[0086] 
Table 3 
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[0087] 

As a result, the culture supernatant obtained from the 
N-terminus-modif ied NCE4 transformant exhibited a small 
reduction in the activity in the presence of LAS. Upon a 
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comparison of the endoglucanase activities in the presence 
of LAS, it was found that the endoglucanase activity of N- 
terminus-modif ied NCE4 was 2,1 times higher than that of 
wild-type NCE4 . Further, the endoglucanase activity of 
Humicola insolens MN200-1 accounted for approximately 4% of 
the transf ormant , and thus it was found that almost all of 
the endoglucanase activities were derived from the 
recombinant enzymes expressed by the incorporated expression 
vectors . 
[0088] 

(4) Analysis of N-terminal amino acid sequences of wild-type 
NCE4 and N-terminus-modif ied NCE4 

Culture supernatants of the above transf ormants were 
subjected to SDS-PAGE, and the electrophoresed proteins were 
electrically transferred on a PVDF membrane ( Immobilon-PSQ; 
Millipore) . The NCE4 bands of approximately 43 kDa were 
excised from the blot, and subjected to Protein Sequencer 
Model 492 (Applied Biosystems) to analyze the amino acid 
sequences . 

[0089] 

As a result, it was confirmed that the sequence of 
wild-type NCE4 obtained from the pN2EX-N4 transformant was 
consistent with the DNA sequence of NCE4 derived from 
Humicola insolens MN200-1 described in WO98/03667. 

However, the amino acid sequence of N-terminus- 
modified NCE4 obtained from the pEGDOl transformant could 
not be determined in the above procedure. A Pfu 
pyroglutamate aminopeptidase (Takara Bio) was used to remove 
the modified N-terminus from N-terminus-modif ied NCE4, and 
then the amino acid was determined (SEQ NO: 2) . 

The results were summarized in Table 4, to compare the 
N-terminal amino acids and the N-terminal amino acid 
sequences in wild-type NCE4 and N-terminus-added NCE4, and 
it was confirmed from the results that each NCE4 had the N- 
terminal amino acid sequence expected from the corresponding 
constructed expression vector. 
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[0090] 
Table 4 





N-terminal 


N-terminal amino 




amino acid 


acid sequence 


Wild-type NCE4 


Alanine 


ADGKSTR 




(unmodified) 


(SEQ NO: 19) 


N-terminus- 


Pyroglutamic acid 


pQNCGSADGKSTR 


modified NCE4 


(modified) 


(SEQ NO: 20) 



[0091] 

Example Bl: Construction of expression vectors for 
Trichoderma viride for expressing N-terminus-modif ied 
cellulase and wild-type cellulase 
(1) Construction of expression vectors 

From a PstI fragment of 7 kb containing the cbhl gene 
derived from Trichoderma viride MC300-1 (obtained in 
accordance with W098/11239), a Hindi I I fragment of 
approximately 3.1 kb containing a cbhl promoter region, and 
a Sail fragment of 2.7 kb containing a cbhl terminator 
region were cloned into plasmid pUC119 to construct plasmid 
pCBl-H4 and plasmid pCBl-Sl, respectively. E. coli JM109 
containing each resulting plasmid was infected with helper 
phage M13K07 to obtain each single-stranded DNA. To the 
resulting single-stranded DNAs of plasmids pCBl-H4 and pCBl- 
Sl as templates, phosphorylated oligonucleotides CBn-Stu and 
CBc-Xho were annealed, and mutations were introduced by a 
Sculptor in vitro mutagenesis system (Amersham Bioscience), 
to construct plasmids pCBlH4-19 and pCBlSl-17, respectively. 
An approximately 6 kb fragment, which had been obtained by 
digesting plasmid pCBlH4-19 with Xbal and Xhol , was ligated 
with an approximately 1.2 kb fragment, which had been 
obtained by digesting plasmid pCBlSl-17 with Xbal and then 
partially digesting it with Xhol, to construct pCBl-M. 
[0092] 

CBn-Stu: 5 ' -GATACATGATGCGCAGGCCTTAGTCGACTAGAATGC-3 ' (SEQ NO: 
21) 

CBc-Xho: 5' -GATCCTCAAGCTTTTGCTCGAGTACCTTACAAGCAC-3 ' (SEQ NO: 
22) 

[0093] 

Next, an approximately 2.7 kb fragment obtained by 
digesting pCBl-M with Sail was cloned into plasmid pUC119 to 
construct plasmid pCBl-SalM. To a single-strand DNA 
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obtained from plasmid pCBl-SalM, phosphorylated 
oligonucleotides CBl-SmSph, CBl-Bam, and CBl-Pst were 
annealed, and mutations were introduced by the Sculptor in 
vitro mutagenesis system. 
[0094] 

CBl-SmSph : 5 ' -GGAGGGTGCATGCCGACTGAGCCCGGGCAGTAGCC-3 ' ( SEQ NO : 
23) 

CBl-Bam: 5 ' -GCCGGGAGAGGATCCAGTGGAGG-3 ' (SEQ NO: 24) 
CBl-Pst: 5 ' -GCTCGAGTACCTTACTGCAGGCACTGAGAG-3 ' (SEQ NO: 25) 
[0095] 

Next, plasmid pUC118 was digested with Xbal and EcoRI, 
blunted with the DNA blunting kit, and self-ligated, to 
construct plasmid pUC118-SBN. To pUC118-SBN digested with 
Hindi I I and Sail, the cbhl promoter fragment of 
approximately 1.4 kb previously digested with the same 
restriction enzymes was inserted. The resulting plasmid was 
digested with Sail, and ligated with the mutations- 
introduced cbhl coding region and terminator region of 
approximately 2.8 kb, to construct pCBl-M2 . The introduced 
mutations (excluding the BamHI mutation) are shown in Table 
5. 

[0096] 
Table 5 

Sa 1 I S tu I 

5' — c t a&tcgac t aa_g£cc_t gc gc a t cATGTATCAAAAGTTGGCXXTI^^ (seq no: 26) 

MetTyrGlnLysLeuAl aLeuI 1 eSe rAl aPheLeuAl aThr ( seq no: 27) 

Sma I Sph l Ps t I Xho I Hin d I I I (seq no: 28) 

5~<JC^^^ClX^AGlXXJQ^KrCCACC CAGTIXX XnXX^G TAAg g t ac tcgag ca aaagc t t gag-3'(s E Q no: 29) 

AlaArgAlaGJnSerAlaCysThr GlnCysLeuGln 

fN-terminus of mature protein 

[0097] 

Example B2 : Construction of expression vectors for 
expressing N-terminus-modif ied STCE1 and wild-type STCE1, 
transformation of Trichoderma viride therewith, and 
evaluation of activities 
[0098] 

(1) Construction of expression vectors 

As an expression vector for wild-type STCE1, the 
coding region of the STCE1 gene was ligated with plasmid 
pCBl-M2 obtained in Example Bl to construct pCB-Ste. 
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First, the coding region of the STCE1 gene was 
amplified by carrying out PCR using plasmid pUC118-STCEex 
( FERM BP-10127) as a template and the combination of primers 
STCE1-TNERV and STCE1-TCET22 I . The resulting PCR product of 
approximately 1 kbp was digested with EcoRV and £coT22l, and 
ligated with pCBl-M2 previously digested with Stul and PstI' 
to construct pCB-Ste. 
[0099] 

STCE1-TNERV: GGGGATATCGCGCATCATGCGTTCCTCCCCCGTCCTC (SEQ NO: 
30) 

STCE1-TCET22 I : GGGATGCATTTAAAGGCACTGCGAGTACCAGTC (SEQ NO: 31) 
[0100] 

As an expression vector for N-terminus-modif ied STCE1, 
the coding region of the mature protein in which the signal 
sequence for secretion was deleted in the STCE1 gene was 
ligated with pCBl-M2 obtained in Example Bl to construct 
pCB-Sts . 

First, the coding region of the mature protein of the 
STCE1 gene was amplified by carrying out PCR using plasmid 
pUC118-STCEex (FERM BP-10127) as a template and STCEl-TmNSph 
and STCE1-TCET22 I as primers. The resulting PCR product of 
approximately 1 kbp was digested with SphI and EcoT22l, and 
ligated with pCBl-M2 previously digested with SphI and PstI 
to construct pCB-Sts. 
[0101] 

STCEl-TmNSph: GGGGCATGCGATGGCAAGTCGACCCGCTAC (SEQ NO: 32) 
[0102] 

(2) Generation of Trichoderma viride strain 2 

A suspension (approximately 10 9 CFU/mL) of spores of 
Trichoderma viride MC300-1 (FERM BP-6047) was irradiated 
with UV from two UV lamps located at a height of 30 cm 
thereabove while gently shaking. The suspension was spread 
on a selection medium, and cultured at 28 °C for 7 days. The 
grown strains were selected to obtain Trichoderma viride 
strain 2 showing uracil requirement. 

In this connection, the selection medium was a minimal 
medium [0.2% potassium dihydrogenphosphate, 0.4% ammonium 
sulfate, 0.03% urea, 0.03% magnesium sulfate heptahydrate, 
0.03% calcium chloride, 0.5% glucose, 2.5% agar, and 0.01% 
trace elements (prepared by dissolving 5mg iron sulfate 
heptahydrate, 1.56 mg manganese sulfate heptahydrate, 1 . 4mg 
zinc sulfate heptahydrate, and 2mg cobalt chloride in 1 L of 
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water) ] supplemented with 10 jjg/mL uridine and 1 mg/mL 5- 

f luoroorotic acid . 

[0103] 

(3) Transf ormant containing wild-type STCE1 or N-terminus- 
modified STCE1 

Trichoderma viride was transformed with pCB-Ste or 
pCB-Sts obtained in Example B2(l), by carrying out a co- 
transformation method using Trichoderma viride strain 2 
obtained in Example B2(2) and marker plasmid pPYR4 
containing the pyr4 gene derived from Neurospora crassa, to 
obtain transf ormants as strains grown in the minimal medium. 

First, Trichoderma viride strain 2 was inoculated in a 
200 mL conical flask containing 50 mL of a mycelium 
formation medium [1% yeast extract, 1% malt extract, 2% 
polypeptone, 2.5% glucose, 0.1% dipotassium 

hydrogenphosphate, 0.05% magnesium sulfate heptahydrate, and 
0.0001% uridine (pH7.0)], and cultured at 28°C for 2 days. 
The resulting cells were collected by centrif ugation, washed 
with 0.5 mol/L sucrose, and suspended in 0.5 mol/L sucrose 
solution containing 3 mg/mL (^-glucuronidase (Sigma), 1 mg/mL 
chitinase, and 1 mg/mL zymolyase 20T (Seikagaku Corp.). The 
suspension was gently shaken at 30 °C for 60 to 90 minutes to 
generate protoplasts. The resulting protoplasts were 
collected by filtration and centrif ugation, washed with the 
SUTC buffer, and resuspended in SUTC to prepare a protoplast 
suspension . 

To 100 ]iL of the protoplast suspension, 10 \iL of each 
DNA solution (pCB-Ste or pCB-Sts) was added. After each was 
allowed to stand on ice for 5 minutes, 400 yL of the PEG 
solution was added, and the whole was allowed to stand on 
ice for 20 minutes. After the resulting protoplast 
suspension was washed with the SUTC buffer, the minimal 
medium containing 0.5 mol/L sucrose was overlaid with the 
protoplast suspension together with soft agar. The plate 
was incubated at 28 °C for 5 days, and then grown colonies 
were transferred on the minimal medium, to obtain 
transf ormants as colonies grown thereon. 
[0104] 

(4) Comparison of endoglucanase activities of wild-type 
STCE1 and N-terminus-modif ied STCE1 in culture supernatants 

The resulting transf ormants were cultured in 
accordance with the method described in W098/11239, to 
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obtain culture supernatants . The culture supernatants were 
subjected to SDS-PAGE, to select culture supernatants in 
which the STCE1 band of approximately 45 kDa was clearly 
detected. The endoglucanase activities in the culture 
supernatants at pHIO in the absence and presence of LAS were 
measured, and the degrees of a reduction in the activity in 
the presence of LAS were compared. The results are shown in 
Table 6. In Table 6, the endoglucanase activities are 
represented by percentages when each endoglucanase activity 
at pHIO is regarded as 100. 
[0105] 
Table 6 
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[0106] 

As a result, on comparison of the endoglucanase 
activities in the presence of LAS, it was found that the 
endoglucanase activity of N-terminus-modif ied STCE1 was 
approximately 1.8 times higher than that of wild-type STCE1. 
Further, the endoglucanase activity of Trichoderma viride 
strain 2 was not detected, and thus it was found that the 
endoglucanase activities were derived from the recombinant 
enzymes expressed by the incorporated expression vectors. 
[0107] 

(5) Analysis of N-terminal amino acid sequences of wild-type 
STCE1 and N-terminus-modif ied STCE1 

From each culture supernatant of the above 
transf ormants, a solution thereof containing 0.05 mol/L 
Tris-HCl (pH8.0)/l mol/L sodium sulfate was prepared, and 
adsorbed to an equal bed volume (BV) of TOYOPEARL Butyl-650S 
(Tosoh Corporation) . The carriers were washed with three 
BVs of 0.05 mol/L Tris-HCl (pH8.0)/l mol/L sodium sulfate to 
remove non-adsorption components. The elution was carried 
out with three BVs of 0.05 mol/L Tris-HCl (pH8.0)/0.5 mol/L 
sodium sulfate, and, the eluted solution was concentrated by 
Pellicon XL (cut 5000; Millipore) and Ultrafree-15 (cut 
5000; Millipore) . From the obtaining concentrate, a 
solution thereof containing 0.1 mol/L sodium phosphate 
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(pH5.8)/l mol/L ammonium sulfate was prepared. The 
resulting solution was subjected to a hydrophobic 
chromatography using Resource PHE (6 mL; Amersham 
Bioscience), to recover a non-adsorption fraction. The non- 
adsorption fraction was concentrated, and desalted by PD-10 

(Amersham Bioscience) , and, from the fraction, a solution 
thereof containing 0.05 mol/L Tris-HCl (pH7.5) solution was 
prepared. The solution was subjected to Resource Q (6 mL; 
Amersham Bioscience), to recover a non-adsorption fraction. 
The non-adsorption fraction was desalted and, from the 
fraction, a solution thereof containing 0.05 mol/L sodium 
acetate (pH5.0) solution was prepared. The solution was 
subjected to Resource S (6 mL; Amersham Bioscience), to 
recover a non-adsorption fraction. The non-adsorption 
fraction was concentrated and subjected to a gel 
chromatography using Superdex 75pg (16*60 mm; Amersham 
Bioscience) to recover a fraction having a fractionated 
molecular weight of approximately 45000. Each fraction 

(derived from each culture supernatant of the above 
transformants) contained only the single band of 45 kDa. 

The purified fractions were subjected to SDS-PAGE, and 
the electrophoresed proteins were electrically transferred 
on a PVDF membrane. The STCE1 bands of approximately 4 5 kDa 
were excised from the blot, and subjected to Protein 
Sequencer Model 492 (Applied Biosystems) to analyze the 
amino acid sequences. 

As a result, it was confirmed that the sequence of 
wild-type STCE1 obtained from the pCB-Ste transformant was 
consistent with the DNA sequence (SEQ ID NO: 5) of STCE1 
derived from Staphy lot ri chum coccosporum IFO (present name: 
NBRC) 31817. 

However, the amino acid sequence of STCE1 obtained 
from the N-terminus-modif ied STCE1 transformant could not be 
determined in the above procedure. A Pfu pyroglutamate 
aminopeptidase (Takara Bio) was used to remove the modified 
N-terminus from N-terminus-modif ied STCE1, and then the 
amino acid was determined (SEQ NO: 4). 

The results were summarized in Table 7 to compare the 
N-terminal amino acids and the N-terminal amino acid 
sequences in wild-type STCE1 and N-terminus-modif ied STCE1. 
It was confirmed from the results that each STCE1 had the N- 
terminal amino acid sequence expected from the corresponding 
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constructed expression vector. 
[0108] 
Table 7 
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[0109] 

(6) Comparison of endoglucanase activities of purified wild- 
type STCE1 and purified N-terminus-modif ied STCE1 

The endoglucanase activities of purified wild-type 
STCE1 and purified N-terminus-substituted STCE1 obtained in 
Example B2(5) at pHIO in the absence and presence of LAS 
were measured, and the degrees of a reduction in the 
activity in the presence of LAS were compared. The results 
are shown in Table 8. In Table 8, the endoglucanase 
activities are represented by percentages when each 
endoglucanase activity at pHIO is regarded as 100. 

[0110] 
Table 8 
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[0111] 

As a result, on comparison of the endoglucanase 
activities in the presence of LAS, it was found that the 
endoglucanase activity of purified N-terminus-modif ied STCE1 
was approximately 1.9 times higher than that of purified 
wild-type STCE1. 
[0112] 

Example B3 : Change of N-terminal sequence added to STCE1, 
construction of vector, transformation of Trichoderma viride 
therewith, and evaluation of activities 
(1) Construction of expression vector for adding only 
pyroglutamic acid to the N-terminus of mature protein 
As an expression vector for STCE1 in which only 
pyroglutamic acid was added, the coding region of the mature 
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protein in which the signal sequence for secretion was 
deleted in the STCE1 gene was ligated with plasmid pCBl-M2 
obtained in Example Bl to construct pCB-Stmll. 

First, the coding region of the mature protein of the 
STCE1 gene was amplified by carrying out PCR using plasmid 
pUC118-STCEex (FERM BP-10127) as a template and STCE-TmNSma 
and STCE1-TCET22 I as primers. The resulting PCR product of 
approximately 1 kbp was digested with Smal and £coT22I, and 
ligated with pCBl-M2 previously digested with Smal and PstI 
to construct pCB-Stmll. Trichoderma viride transformed with 
vector pCB-Stmll constructed as above produces STCE1 having 
the N-terminal amino acid sequence of pQADGKSTR (SEQ ID NO: 
41) [i.e., the sequence in which one pyroglutamic acid (pQ) 
is added to the N-terminus of wild-type STCE1] . 
[0113] 

STCE-TmNSma: GGCCCGGGCTCAGGCCGATGGCAAGTCGACCCG (SEQ NO: 35) 
[0114] 

(2) Comparison of endoglucanase activity of pyroglutamic 
acid-added STCE1 

Trichoderma vlride strain 2 was transformed with pCB- 
Stmll in accordance with the procedure described in Example 
B2(3). The resulting transf ormants were cultured in 
accordance with W098/11239 to obtain culture supernatant s . 
The culture supernatants were subjected to SDS-PAGE, to 
select a culture supernatant in which the STCE1 band of 
approximately 45 kDa was clearly detected. The 
endoglucanase activity in the culture supernatant at pHIO in 
the absence and presence of LAS were measured, and the 
degree of a reduction in the activity in the presence of LAS 
was compared to that of wild-type STCE1 in Example B2(4). 
The results are shown in Table 9. In Table 9, the 
endoglucanase activities are represented by percentages when 
each endoglucanase activity at pHIO is regarded as 100. 

[0115] 
Table 9 
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As a result, on comparison of the endoglucanase 
activities in the presence of LAS, it was found that the 
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endoglucanase activity of pyroglutamic acid-added STCE1 was 
approximately 1.6 times higher than that of wild-type STCE1. 
[0116] 

Example B4 : Change of N-terminal sequence added to STCE1, 
construction of vector, transformation of Trichoderma viride 
therewith, and evaluation of activities 

(1) Construction of expression vector for adding pQ- 
containing 4 amino acids to the N-terminus of mature protein 

As an expression vector for STCE1 in which 4 amino 
acids containing pyroglutamic acid were added, the coding 
region of the mature protein in which the signal sequence 
for secretion was deleted in the STCE1 gene was ligated with 
plasmid pCBl-M2 obtained in Example Bl to construct pCB- 
Stml2. 

First, the coding region of the mature protein of the 
STCE1 gene was amplified by carrying out PCR using plasmid 
pUC118-STCEex ( FERM BP-10127) as a template and STCE-TmNSph- 
2 and STCE1-TCET22 I as primers. The resulting PCR product 
of approximately 1 kbp was digested with SphI and EcoT22l, 
and ligated with pCBl-M2 previously digested with SphI and 
PstI to construct pCB-Stml2. Trichoderma viride transformed 
with vector pCB-Stml2 constructed as above produces STCE1 
having the N-terminal amino acid sequence of pQSACADGKSTR 
(SEQ ID NO: 42) [i.e., the sequence in which a peptide 
consisting of 4 amino acids (the N-terminus is pyroglutamic 
acid) is added to the N-terminus of wild-type STCE1] . 
[0117] 

STCE-TmNSph-2: GGGCATGCGCCGATGGCAAGTCGACCCGC (SEQ NO: 36) 
[0118] 

(2) Comparison of endoglucanase activity of pQ-containing 4- 
amino-acids-added STCE1 

Trichoderma viride strain 2 was transformed with pCB- 
Stml2 in accordance with the procedure described in Example 
B2(3). The resulting transf ormants were cultured in 
accordance with W098/11239 to obtain culture supernatants . 
The culture supernatants were subjected to SDS-PAGE, to 
select a culture supernatant in which the STCE1 band of 
approximately 45 kDa was clearly detected. The 
endoglucanase activity in the culture supernatant at pHIO in 
the absence and presence of LAS were measured, and the 
degree of a reduction in the activity in the presence of LAS 
was compared to that of wild-type STCE1 in Example B2(4). 
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The results are shown in Table 10. In Table 10, the 

endoglucanase activities are represented by percentages when 

each endoglucanase activity at pHIO is regarded as 100. 

[0119] 
Table 10 
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As a result, on comparison of the endoglucanase 
activities in the presence of LAS, it was found that the 
endoglucanase activity of pQ-containing 4-amino-acids-added 
STCE1 (pQ-containing 4-AAs-added STCE1) was approximately 
1.5 times higher than that of wild-type STCE1. 



INDUSTRIAL APPLICABILITY 
[0120] 

The protein of the present invention exhibits a small 
reduction in an endoglucanase activity in the presence of a 
surfactant, in comparison with wild-type cellulases, and 
thus is useful as a component of a detergent for clothing. 

Although the present invention has been described with 
reference to specific embodiments, various changes and 
modifications obvious to those skilled in the art are 
possible without departing from the scope of the appended 
claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0121] 

Figure 1 shows the results of comparison among the 
amino acid sequence of endoglucanase STCE1 [signal peptide 
(SEQ ID NO: 43) and mature protein (SEQ ID NO: 44)] and the 
amino acid sequences of known endoglucanases belonging to 
family 45, NCE4 [signal peptide (SEQ ID NO: 45) and mature 
protein (SEQ ID NO: 46)] and NCE5 [signal peptide (SEQ ID 
NO: 47) and mature protein (SEQ ID NO: 48)], with respect to 
the amino acid sequences of N-terminal half. 

Figure 2 shows the results of the comparison described 
in Figure 1, with respect to the amino acid sequences of C- 
terminal half. 
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FREE TEXT IN SEQUENCE LISTING 
[0122] 

Features of "Artificial Sequence" are described in the 
numeric identifier <223> in the Sequence Listing. Each of 
the nucleotide sequences of SEQ ID NOS : 6-10, 12-13, 15-18, 
21-26, 28, 30-32, and 35-36 is primer MNC-02, primer MNC-03, 
primer MKA-05, primer MKA-06, pMKDOl, primer pMN-Bam, pJDOl, 
primer NCE4-Ne, primer NCE4-Ce, primer NCE4-Ns, primer NCE4- 
Cs, primer CBn-Stu, primer CBc-Xho, primer CBl-SmSph, primer 
CBl-Bam, primer CBl-Pst, pCBl-M2, pCBl-M2, primer STCE1- 
TNERV, primer STCE1-TCET22I , primer STCEl-TmNSph, primer 
STCEl-TmNSma, and primer STCEl-TmNSph-2 . Each of the amino 
acid sequences of SEQ ID NOS: 11, 14, 27, and 29 is pMKDOl, 
pJDOl, pCBl-M2, and pCBl-M2. 



